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THE SPATIAL RESOLVING POWER OF EARTH RESOURCES SATELLITES: 

A REVIEW 


1. Introduction 

Earth resources satellites which are currently being constructed or designed for launching in 
the 1 980’s will provide images with substantially better spatial resolution than those commonly 
available from civilian satellites during the last decade. In the 1970’s the very large majority of 
satellite images of the Earth’s surface suitable for the study of terrain properties were obtained 
from the Landsat series of satellites. Many Earth scientists were quick to exploit the potential 
of this new source of environmental information (e.g. NASA 1973a; NASA 1975), but others found 
it wanting. Not least, this was caused by the lack of ground detail which the imagery displays 
(i.e. its low spatial resolution) compared with that in conventional air photographs. 

Operational use of such data has similarly been hindered by their relatively poor resolution. 
Figure 1 shows the results of a recent survey of the needs of the major U.S. government agencies 
for remote sensing data in terms of spatial resolution. Given the current resolution of the Landsat 
MSS of 79m it is clear that the needs of many users are not apparently being met. 

These perceived needs for higher resolution imagery will be partially satisfied by the better 
resolving capabilities of Landsat D scheduled to be launched in 1982 (CORSPERS, 1976; Salomon- 
son, 1978; Salomonson et al. 1980) and of SPOT (Systeme Probatoire d’Observation de la Terre) 
to be launched in 1 984 (Gaubert, 1978). Other high resolution systems that have been proposed 
include Mapsat (Colvocoresses, 1979a), Stereosat with sensors looking fore and aft to produce 
stereoscopic coverage, (Jet Propulsion Laboratory, 1979) a large format camera on Spacelab (Doyle, 
1979) and a Synchronous Earth Observation Satellite (Doyle, 1978). Additionally there are classi- 
fied surveillance systems such as the KH-1 ! of the U.S. with very high resolving powers, though 
the resultant imagery is not widely circulated. 
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U.S. GOVERNMENT AGENCY USER REQUIREMENTS 
FOR SPATIAL RESOLUTION OF REMOTELY SENSED 
DATA FROM SPACE 
(INTER-AGENCY TASK FORCE. 1979) 



GROUND RESOLUTION REQUIREMENTS 
(IFOV IN METRES) 


Figure 1. Survey of spatial resolution requirements of major US government 
agencies. (Inter-Agency Task Force, 1979) 


In practice the capabilities of such future systems, and even current ones, are poorly compre- 
hended by many earth scientists including geographers. In large part this arises because users do 
not properly understand the significance of resolution figures which are quoted. The most com- 
monly available measure is the instantaneous field of view (IFOV), v/hich for many applications 
significantly over-estimates the capabilities of sensing systems. By borrowing from such disciplines 
as electrical engineering, optics, and photographic science other measures of spatial resolving power 
of remote sensing systems have been derived which are often more informative; these are reviewed 
in the next section. We can obtain a more practical insight into the benefits of improved spatial 
resolution by examining the results of several empirical simulation experiments of future satellite 
systems in Section 3. Finally it is important to recognize factors other than spatial resolution 
which strongly affect the detail of information extractable from remote sensing imagery (Sec- 
tion 4). 


A. 



Our attention is restricted to the visible and near infrared parts of the spectrum, since this 
is the source of nearly all higher resolution imagery that has been obtained so far, and extensive 
collection of data outside of these wave bands at high resolutions is unlikely to take place until 
the late 1 980’s or early 1990’s. 

2. Concepts of Spatial Resolution 

Spatial resolution refers to the fineness of detail depicted in an image that is it describes the 
minimum size of objects on the ground that can be separately distinguished or measured. Trans- 
forming this apparently straightforward concept into an operational quantitative measure has 
proven far from simple. There is no single satisfactory measure of spatial resolution available, nor 
can values obtained from one method necessarily be readily converted to those derived by another. 
Spatial resolution turns out to be a much more complex topic than our initial intuitive definition 
suggests. 

Recently it has been suggested (Forshaw et al., 1980) that definitions of spatial resolution can 
conveniently be assigned to one of four categories; geometrical properties of the imaging system, 
the ability to distinguish between point targets, the ability to measure the periodicity of repetitive 
targets and the ability to measure the spectral properties of small finite targets. Examples of these 
will be reviewed in turn and their relative merits discussed. 

2.1 Measures based on the geometric properties of imaging systems 

The only measure that needs to be considered in this category is the instantaneous field of 
view (IFOV). This is probably the simplest measure of spatial resolution available and is in one 
respect the most important, since it is the most widely quoted for satellite systems. It is usually 
calculated as follows (see Fig. 2): 

Hd 

IFOV = 

i 

where H is the satellite orbit height 

d is the detector size 
f is the focal length of the optical system 






GEOMETRIC 

INSTANTANEOUS 
FIELD OF VIEW. 

(IN PRACTICE f ^ H) 



Figure 2. Definitions of instantaneous field of view (IFOV). 
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Since detector size has to be defined, this measure is most applicable to sensors with dis- 
crete detectors such as line-scanners (Lansing and Cline, 1975) or push-broom radiometers (Thomp- 
son, 1979; Wight, 1979) (Figure 3). In the former, images are built up across the track of the satel- 
lite by the movement of a mirror and along track by the forward movement of the satellite. Thus 
the final image is comprised of a matrix of picture elements or pixels. This approach was used in the 
multispectral scanner system (MSS) of Landsats 1, 2, and 3 and will be used in the Thematic Map- 
per of Landsat D (see footnote 6 in Table 1). In push-broom radiometers, the need for a moving 
mirror for the across track elements is dispensed with. On a single monolithic chip of silicon, 
hundreds to over a thousand detectors can be manufactured along with amplifiers and electronic 
multiplexing circuits (Thompson 1979): these detectors are electronically sampled such that the 
entire linear array is read out in the time to advance along track by one resolution element. The 
push-broom configuration is being adopted in the French SPOT mission and probably will be 
adopted in the U.S. operational earth resources satellite. 

For the MSS of the Landsat series, the IFOV is normally quoted as 79m, (except for the 
thermal channel of Landsat 3 which is 237m). Colvocoresses ( 1 979b) and Slater ( 1 979) point out 
that the detector size is reduced due to cladding (walls and adhesives) around the fibre optics through 
which the photons pass to reach the detectors. This results is an IFOV of 73.4m according to the 
former and 76.2m according to the latter. Moreover since the altitude of Landsats have varied 
from 880 to 940 kms, the IFOV has varied from 76m to 81m (Gordon 1980) ignoring the effects 
of cladding. 

The IFOV does not necessarily give the minimum size of objects that are detectable. An ob- 
ject smaller than this size may be sufficiently brighter or darker than its surroundings to change the 
overall radiance of the pixel, so that it is detectable. Thus roads and rivers narrower than 79m are 
frequently detectable on Landsat MSS images. The alignment of a linear object is also crucial. 

Its chances of detection will depend strongly on whether its central axis falls along the centre of a 
scan line or falls along the boundary between two scan lines (Gurney 1980). In the latter case, 
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Table 1. Principal civilian satellites (past, present and future) for regular collection 
and dissemination of terrestrial Earth resources data 
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the likelihood of its being detected wiH clearly be lower. Whereas objects less than 79m across may 
be detected; many objects equal to or greater than this value will not be detected. Commonly in 
Landsat MSS imagery it has been found that medium to low contrast objects are detectable only if 
they are 250m across or more. One immediately obvious explanation for this is that our ability to 
detect an object depends on its contrast with its surroundings in relation to the sensor’s ability to 
detect small differences in radiance. This is measured by the signal-to-noise ratio (S/N), which is 
simply the ratio of the signal to the total noise present. Two types of noise are present in all detec- 
tors, namely the photon (or shot) noise due to random fluctuations Of photons striking the detector 
and the Johnson (or Nyquist) noise as a result of thermal effects in the sensor. Various other types 
of noise exist specific to the particular sensor (Baker and Scott, 1975). Figure 4 illustrates the 
character of the S/N ratio. Signals comparable to or smaller than the combined noise level will be 
undetectable. Moreover the signal from sensors will usually be quantized for transmission to the 
ground surface (though not for the RBV sensor of Landsat 3), and for subsequent digital analysis. 
This digitization will further add noise to the signal. The impact of different quantization levels 
on the use of remotely sensed data has recently been examined by Tucker (1979). Apart from 
hardware noise we can consider atmospheric effects as a noise-like phenomon which can increase or 
decrease the signal received by the sensor (Fraser, 1974; Slater, 1975a). The net effect is to reduce 
the contrast of an image. Thus, the minimum size of objects which can be detected in a given scene 
will be a function of local atmospheric conditions. Wiersma and Landgrebe (1978) also refer to 
scene noise, caused by those components of a scene reducing the accuracy with which pixels are 
assigned to the correct land cover class. For example soil variation may confuse multi-spectral 
classification of crop types. The significance of this point is dealt with further in Section 3. 

Various blurring effects will also be present and lead to further image degradation (Forshaw 
et al., 1980) and thus will contribute to the disparity between the estimate of resolution based on 
the IFOV and the realizable resolution in terms of the minimum size of objects that can be dis- 
tinguished. These are due to optic phenomena such as aberration and diffraction. The former is 
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much more important than the latter at the wavelengths in the visible and near infrared but at 
longer wavelengths in the microwave, the reverse is the case. Blurring will also be caused by motion 
during imaging due both to the forward movement of the satellite and to the across-track movement 
of the mirror in scanning systems. If a multispectral image is produced, the extent to which the 
images are not geometrically registered will also produce a blurring effect. 

It is usual to equate pixel size in imagery with the IFOV, but this need not be the case. For 
Landsats 1 and 2 MSS data the IFOV and the pixel size are indeed essentially the same. In the 
along track direction, the pixel size equals the ground track velocity divided by the mirror scan 
frequency but across track the continuous signal could in theory be sampled at any arbitrary rate. 

In fact it is chosen to give a pixel width the same as the along track pixel size but with an overlap 
between adjacent pixels of 23m. Hence pixel size for the Landsat MSS is sometimes quoted as 79m 
X 56m (General Electric, undated). The data on Landsat 3 digital tapes have been resampled using 
a cubic convolution algorithm to produce pixels with dimensions of 57 X 57m (Holkenbrink, 1978) 
in which case pixel size and IFOV are dissimilar. However this resampling which produces smaller 
pixels will not improve the actual spatial resolution and may decrease it slightly. 

The previous discussion demonstrates the limitations of the IFOV as a measure of spatial re- 
solution. In the following sections, we discuss various measures which attempt to take these factors 
into account in estimating spatial resolution. However, first, we note than an alternative definition 
of IFOV has been proposed, based on the point spread function (PSF) of an imaging system. The 
point spread function describes the distribution of energy in the image plane, when a point source 
is imaged. In other words it describes the resultant image of a point source. This image is never a 
simple point due to such factors as the motions of the spacecraft and imaging mirror lens’ properties 
as well as atmospheric influences. The point spread function IFOV is defined as the width of the 
point spread function at its half amplitude values (figure 2). The IFOV of 30 metres normally 
quoted for the Theomatic Mapper of Landsat-D is based on this measure; the corresponding point 
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spread IFOV, for the Landsat MSS is somewhat greater than that of the geometric IFOV namely 
90m rather than 79m (Landgrebe et al., 1977). The point spread instantaneous field of view is in 
fact closely related to measures dealt with in the following section. 

2.2 Measures based on the ability to distinguish between point targets 

The most commonly used definition in this category of resolution measures is the Rayleigh 
criterion (Perrin, 1966; Slater, 1975a). Even if a lens is completely aberration-free, the resultant 
image of a point-source, will not itself be a point, but will consist of a central bright disk surround- 
ed by faint dark and light rings. This pattern is known as the Airy pattern, and arises because of 
diffraction effects: this pattern describes one particular type of point spread function (see section 
2.1). The Rayleigh criterion for distinguishing between two targets is based on two equal intensity 
point sources imaged by an unobstructed aberration-free, circular aperture. It states that the two 
targets will be just resolved if the central peak of the image of one source lies on the first dark 
ring of the second. The angular separation (0) is simply calculated as (Slater, 1975a): 

1.22 Xf 

e = 

1000 

where X is the wavelength in micrometers, 
f is the usual f-number of the lens. 

From d and the height of the sensor above the ground, a measure of resolution in terms of 
ground measures can be derived. 

Most remote sensing targets are of course not point sources, and with this in mind Otterman 
(1969) derived a resolution measure for extended circular sources which is more relevant to remote 
sensing. Estimates for square or rectangular objects would probably not be very different. He 
showed that the diffraction limited resolution for such sources is almost six times coarser than that 
for point sources. 
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In practice of course, lenses are not aberration-free and as discussed in the previous section 
there are many other properties which will degrade the image and hence increase the minimum 
separation that is detectable. Measures of this category consequently gives an indication of the 
absolute resolution that is achievable by a lens. 

2.3 Measures based on the ability to measure the periodicity of repetitive targets 

Measures based this property arose principally from work on photographic images (Scott and 
Brock, 1973; Shaw, 1979) though they have been applied to images derived from other sensors 
(e.g. Lavin and Quick, 1974; Buchtemann, 1974). They are based on the observation that if one 
images sets of parallel linear objects, the contrast between them and their background will appear 
10 be lower as their spacing decreases, until a step is reached when the contrast is so small that the 
linear objects are indistinguishable. Values of resolution derived in this way are consequently ex- 
pressed by spatial frequency measures such as line pairs/mm confusingly often abbreviated to 
lines/m.m. Since the linear targets used often have a sinusoidally-varying tone, values are also ex- 
pressed in cycles/mm. Somewhat against standard practice NASA (1973b) has expressed such 
values in terms of single lines/mm or in terms of half-cycles/mm, an approach which has not found 
favour by all (Slater, 1975b). 

Modulation (M) is the measure of contrast most frequently used in this context. 

E “F . 

_ ^max ^mn 

Emax 

E is variously defined as object radiance (Welch, 1977; Smith, 1 978), luminance or photographic 
exposure values determined from microdensitometer readings (Perrin, 1966; Welch, 1971), trans- 
mittance or intensity (Scott and Brock, 1973). M consequently has a maximum value of 1.0. The 
ratio of the image modulation Mj to the object modulation Mq is known as the modulation trans- 
fer factor. If we plot the transfer factor against spatial frequency the resultant curve obtained is 
called the modulation transfer function (MTF) (Steiner and Salerno, 1975) (Figure 5). Usually the 
MTF curve is derived only for high contrast sinusoidal targets but can also be derived for square- 
wave targets. 
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Figure 4. Signal to noise ratio (S/N). 




ancus ii.casures Oi resolution can be derived from MTF curves. For example, vvc can calcu- 
n»i.e the spatial frequency at which the modulation falls to a set proportion of its maximum value. 
The effective instantaneous field of view (EIFOV) is half the value of the spatial frequency for 
which the modulation of an object with a sinusoidal distribution of radiance has dropped to 50% 
of its original value as a result of the modulation transfer function of the imaging system (Slater 
1975b; NASA 1973b) (fig. 5). The ground measurement G in metres is derived from the estimate 
of line pairs per millimetre (L) displayed on the actual images by this equation: 

1 


G = 


lOOOSL 


where S is the scale of the imagery. The EIFOV of the Landsat MSS (using the 0.5 MTF value) is 
66m, which is rather smaller than the IFOV of 79m (Welch 1977). If the full cycle definition is 
used, the value is 132m. Using the MTF curve for the return beam vidicon camera given in General 
Electric (undated), an estimated EIFOV of 38.8m is obtained. The IFOV of the Thematic Mapper 
of 30m corresponds to a modulation transfer factor of 0.35 for a square wave response (Blanchard 
and Weinstein, 1980). From data quoted in Morgenstem et. al. (1976) it appears that the EIFOV 
is approximately 45 meters. An alternative approach is to derive a demand modulation curve (also 
called the threshold modulation or aerial image modulation curve) which is a plot of the minimum 
image modulation required to produce a response in a sensor as a function of spatial frequency. 
Thus when plotted on a graph of the modulation transfer factor against spatial frequency (Figure 
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EXAMPLE OF A MODULATION TRANSFER CURVE FOR LENS 
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Figure 5. Modulation transfer functions of the Landsat MSS and a typical lens (L), with a threshold modulation (TM) cUrve for 
film. Intersection of the last two curves indicates the finest resolution possible with this combination of lens and film. 
Derivation of the effective instantaneous field of view (EIFOV) is shown for the Landsat MSS. 




5), the curve plots upwards from left to right. The intersection of this curve with the system MTF 
gives the limiting resolution of the system. This approach has been applied most commonly to 
photographic film (Welch 1972; Smith, 1978) when the curves are usually called threshold modula- 
tion (TM) curves. Since MTF curves are almost invariably non-linear, use of a single value can be 
misleading. A more comprehensive estimate is to calculate the equivalent bandwidth, which is 
obtained by replacing the MTF curve by a rectangle of equivalent area and giving the upper bound 
as the value of limiting resolution, though even this will be a simplification (Smith, 1978). A further 
refinement with reference to visual interpretation is to calculate the modulation transfer function 
area (MTFA) 

MTFA = /[M(k)-D(k)] dk 

where M(k) is the imagery system MTF and D(k) is the detection threshold curve for the human 
visual system (Schindler, 1 979). Despite the apparently self-evident relevance of such measures 
for assessing the usefulness of space imagery for photo-interpreters this approach does not appear 
to have been considered. 

MTF curves can be derived by imaging bar targets of varying width in the laboratory, but are 
preferably derived from large bar targets on the ground. Since standard resolution targets of suffi- 
cient size are unavailable for space imagery, MTF curves have been derived from edge (Corbett, 

1974) or line targets (e.g. Welch, 1 974) such as coast lines or roads, and by using intermediate 
images from aircraft flying under the orbiting sensor (e.g. Schowengerdt, 1976). 

Where a sensor does not possess discrete detectors the IFOV cannot be directly calculated 
and estimates of resolution based on MTF are especially appropriate. This is the case both for 
photographic products from sensors such as the Earth Terrain Camera of Skylab and for the return 
beam vidicon (RBV) camera on board Landsats. The latter are similar to television cameras; when 
the camera is shuttered the images are stored on a photo-conducting surface which is then scanned 
initially by an electron beam and an analogue signal derived from the depleted reflection within 
the image tube (Eastman, 1970; Baker and Scott, 1975). In Landsat 3, the RBV cameras were 
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configured to give monochromatic imagery with a narrower total field of view and hence with 
much better resolution than that of the RBV cameras and MSS on board Landsats 1 and 2. The 
EIFOV of the latter derived from laboratory derived curves (General Electric, undated) is about 
39m. The pixel size of Landsat 3 RBV images is rather smaller than this, namely 21.8m X 23.8m 
(RCA, 1 977). As already indicated in Section 2. 1 , such pixel sizes may well give an over-estimate 
of the actual resolution. 

Although the MTF approach provides a much more comprehensive description of system 
resolving power than measures based on geometric properties or the ability to distinguish between 
point targets it has its limitations. It is based on high contrast objects which are long compared 
with their width, and most ground targets do not have this form. Hence measures derived from 
MTF curves may imply an overly-optimistic performance to the unwary. 

2.4 Ability to measure the spectral properties of small finite targets 

The automated classification of images is of increasing importance (Swain and Davis, 1978; 
Townshend, 1981). Such classification is usually highly dependent on the fidelity of the spectral 
measurements that are recorded by the sensor. Consequently resolution measures which indicate 
the minimum size of targets for which the spectral properties can be recorded to a given of accuracy 
are potentially of great value. Colvocoresses (1979b) suggested a measure of resolution called the 
effective resolution element (ERE) based on the size of area for which a single radiance response 
(value) can be assigned with reasonable assurance that the response is within 5% of the value rep- 
resenting the actual relative radiance. He derived values of 86m for the Landsat MSS, 30m for the 
Landsat 3 RBV cameras, and gives an estimated value of 35m for the Thematic Mapper of Landsat- 
D. Subsequently Strome ( 1 979) suggested a refinement of this idea and defined a modified ERE 
based on a single homogeneous area surrounded by a much larger homogeneous one, whose mea- 
sured radiation is 30% of the full-scale of the measuring instrument. The ERE is defined as the 


15 


minimum area for which spectral properties of the centre can be assigned with at least 95% confi- 
dence that the values differ from the actual parameter values by no more than 5% of the full scale 
of the measuring instrument (Strome, 1 979). 

Estimation of this area demands we know the probability distribution of the observed signal 
which IS estimated from the point spread function and system noise. Derivation of such distribu- 
tions for various sensors are provided in Forshaw et. al. (1980) but as yet no direct quantitative 
estimates of this potentially very useful measure of spatial resolution using this method have been 
made. We can gam an indication of the size of such estimates by reference to earlier work of Nor- 
wood ( 1 974), who modelled the performance of Landsat multispectral scanners using MTF, system 
noise and accuracy of radiometric calibration. Graphical results (Norwood, 1974, Table 5) indicate 
that for typical agricultural scenes there will be a 5% error in radiance values for the Landsat MSS, 
channels 4 & 6, when field size is approximately 125m and 200m respectively. Norwood (1974) 
points out that the error for small field dimensions will tend to be dominated by the MTF error, 
and for larger fields diminishes to asymptote dependent on noise and calibration error. 

If our major concern is classification of images then it would be useful to know what is the 
minimum area occupied by a particular cover type that can be classified to a certain degree of 
accuracy with a given probability. This is discussed further in Section 3. 

-•5 Implications for users 

It is clear that no single definition of spatial resolution is possible, since different users are 
concerned with different image properties and these demand alternative measures of resolution. 
Very different estimates of the resolution may therefore be obtained for the same sensor as can 
be seen in Table 2 which summarizes the various estimates of the resolution of the Landsat MSS. 
Thus those particularly concerned with spectral properties as is often the case for those mapping 
land cover by computer-assisted methods should find measures like the ERE most appropriate. 

On the other hand if analysis is based primarily on traditional visual photo-interpretation methods 
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Table 2. Estimates of the resolving power of the Landsat MSS. 


Resolution Measure 

Source 


Resolution 

(meters) 

1. IFOV — geometric 

NASA 1972 


79 

2. IFOV — geometric 

Slater, 1979 


76.2 

3. IFOV — geometric 

Colvocoresses, 1979 


73.4 

4. IFOV — geometric (min. altitude) 

Gordon 1980 


76 

5. IFOV — geometric (max altitude) 

Gordon 1980 


81 

6. Pixel size 

General Electric (undated) 


79 X 56 

7. Pixel size — resampled 
(Landsat 3 CCT’s) 

Holken brink, 1978 


57 X 57 

8. IFOV — point spread 

Landgrebe, et al., 1977 


90 

9. EIFOV — half cycle 

Welch, 1977 


66 

10. EIFOV - full cycle 

Welch, 1977 


135 

11. ERE 

Colvocoresses, 1979b 


87 

12. Modified ERE — estimate 
for Channel 4 

Norwood, 1974 


125 

13. Minimum classifiable area 

Shay et al., 1975 
General Electric, 1975 


500 X 350 
320 X 220 
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as is often the case for those making inferences about sub-surface conditions such as hydrological 
or geological ones, then measures based on the MTF should be preferable. In practice a user of 
remotely sensed data may only have an IFOV value available as a measure of spatial resolution. As 
a comparative measure this can be useful, so long as its limitations are clearly understood. Thus 
the smaller IFOV of the Thematic Mapper compared with the MSS should lead to an approximately 
ninefold reduction in the area of detectable targets with the same spectral properties, spatial char- 
acteristics and contrast with their surroundings. It should be apparent however that the IFOVs 
cannot simply be translated into the ground measurements of the smallest detectable objects. 

The latter will depend on many factors, not least of which are the terrain properties themselves 
which are being observed. These will affect the spectral response of objects and their surroundings 
and hence largely contribute to the contrast of objects with their surroundings and thus to the 
ability of a sensor to resolve them. 

3. Benefits of improvements in the resolving power of future satellite systems 

It is important that the effects of improving the spatial resolution of satellite images are under- 
stood. Firstly, the users of such data should be aware of the capabilities of systems due to be 
launched in the near future. Secondly the designers of future systems require this information so 
that sensors with appropriate spatial resolution are eventually launched. The consequences of finer 
resolution on the accuracy of automated classification and on visual interpretation are discussed. 

3.1 Automated classification 

The potential benefits of better spatial resolution have been assessed in several empirical 
simulations using data obtained from sensors in aircraft. The resultant imagery is progressively de- 
graded to assess the effects of decreasing resolution on the usefulness of the images. Most of the 
experiments have carried out such degradation simply by taking regular square blocks of the orig- 
inal aircraft pixels and averaging them to create new pixels (e.g. Clark and Bryant, 1977; Kan et al., 
1975; Thompson et al, 1974). Such simple averaging does not fully take into account possible 
differences between the simulated imagery and the actual imagery from satellite-borne sensors in 
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terms of properties such as their point spread functions and signal-to-noise ratios. More sophisti- 
cated algorithms have therefore been applied by some workers to the original aircraft imagery to 
simulate satellite imagery more closely (e.g. Sadowski and Sarno, 1976; Morgenstern etal., 1976). 
The principal objective of nearly all these experiments has been to assess the effects of changing 
spatial resolution on the classification accuracy of land cover types identified by computer-assisted 
automated methods. The latter basically rely on the application of multivariate statistical pro- 
cedures, the variables usually being the values of the separate spectral bands. Identification of 
classes is eithei a priori ox a posteriori, the two approaches being termed supervised and nonsuper- 
vised in remote sensing parlance (Swain and Davis, 1 978). The usefulness of the images is then 
usually assessed according to the proportion of an area which is correctly classified. 

a. The effects of scene noise 

Results from these experiments have often been the converse of what we might expect since 
overall cl^^s^ification accuracy has been found to improve as resolution is coarsened (Figure 6) 
over the range or resolutions considered. Explanation for this paradoxical conclusion has usually 
been couched in terms of the internal heterogeneities within individual cover types. For example 
an area of woodland when viewed at high resolution will normally be found to be far from spec- 
trally homogeneous due to the presence of what Wiersma and Landgrebe (1978) term scene noise. 
Some areas of crowns will be in shadow, others will be strongly illuminated; openings in the crown 
cover may leveal the herbaceous cover below; the amount of reflective leaf matter and woody 
tissue will vary between trees* As a second example, consider a suburban residential area where 
high resolution imagery will reveal many separate individual cover sub-types, including roofs, road 
surfaces, grass cover, trees and many others. 

Automated classification of either of the previously described cover types may well lead to 
errors if their separate component heterogeneities are resolved. Some errors may be termed arti- 
ficial classification errors. For example, although an area is considered in reality to be completely 
“suburban residential”, classification of small areas of trees as woodland may be regarded as errors 
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% CLASSIFICATION ERROR 



RESOLUTION (IFOV) (METRES) 


Figure 6. Decline in classification errors with coarser resolution principally as result of scene noise. 

Note that errors were calculated in different ways in the studies, so that absolute numeri- 
cal values are not comparable between studies. 
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by users of the resultant :riaps. even rnough trees exist on the ground. Similarly within an area of 
wildland forest separate identification of smail openings in the canopy may be regarded as irrele- 
vant or confusing detail by some users, but as correct by others depending on their scale of in.ter- 
ests. However some errosrs may be genuine. Areas of dark shadow can be misdassified as water; 
crown CO" deuraiion v;il! affect tile spectral response of trees and may lead to trees^being assigned 
to inccrreci cia-ses. Coarser resolutions will tend to reduce scene noise by averaging out these 
interna! ivetcroueneities giving a spectrai response with much lower variance and thus classif i- 
cation accuracy is likely to he inc.mased (Sadowski et. ah, 1977). 

.'vIob;.ssc; i et f i 'vSf i ;mve rocentiy m.cdelleu the performance o!'a muitispectral scanner sn 
relation to variations in scene c 'v: nno’;. They siiowed, using hypothetical though realistic data. 


•lir- 


cn witiiin the scene the slower wii! be tire improvement in classi- 


fication wi::. in . rc.ising iFOV. 


h. 1 he effec;-' of boupd::r:e< 

'^ive msn:t< cyno-ed ; 'ove rive cmiy a partial indication of the effects of changing resoh.ition. 
since d.ev d.: u-r expiic i;.. eoitsider the effects of boundaries betw-een the cover types being 
c;:'.ss;fyd ;■> eo: in.distir.ct ioi; ;c -.''lOse witlun cover types discussed above. Pixels falling across 
such bound nries wdi .-eeora a mixed response from the two cover types. Moreover even pixels 
no; directlv ove.- f:e hoirnciary -vvii; re affected by the adjacent land cover types as indicated by 
values in fee in>' rmnent point sprea.i function. Radiation scattering by tlie atmosphere will cause 
adddi nne! ccntr;hi;r:ons from surroimding terrain. Classification may assign a mixed pixel to one 
of :ts eons’dru.ent classes esnccirdiy a class occupies a large proportion of the pixel. If the pro- 
portions are more equitabh> distributed then the class to which it is assigned becomes more un- 
certain a".d 'i'is Class may even be one which is not found within the pixel at all. To a certain ex- 
tent the -'roaiem ;.f mixed pixels can be overcome by designing algorithms to estimate proportions 
dirceiiv N, iepka a"d iiyde. : '->72, Horowitz et ai.. 1975). Despite such efforts boundary 
nixeis arc ■nh.er.m, vy more difficult to classify and as they increase proportionately, so classification 



accuracies will have a tendency to decline. Thus the spatial frequency and configuration of bound- 
aries and consequently the area and shape of land cover units will affect classification accuracy. 

We can examine this effect by reference to three resolution degradation experiments in which 
the classification accuracies were assessed when boundary pixels were excluded (figure 7a) and for 
the same scenes when the boundary pixels were included (figure 7b). For the woodland types in 
the study of Sadowski and Sarno (1976) it can be seen that as resolution coarsens, classification 
accuracies tend to increase. For the other two studies no marked trend is apparent except for the 
coarsest resolution in test run 1 of the investigation of Morgenstem et al. (1976). 

When the whole scene including boundaries is analyzed, the cover types examined by Morgen- 
stem et al. (1976) nov/ show a consistent decline especially marked for the second traverse. Classi- 
fication accuracies in the study of Thompson et al. (1974) show a marked decline at higher IFOVs. 
Results from the latter experiments are especially informative since they are stratified into field size 
classes. Only the largest field size class shows- an increase in accuracy at the highest IFOV. The 
curve for the results of forest classification still show's an upward trend but it is less steep when 
boundaries are included. 

c. Joint affects of scene noise and boundaries 

From these results w'e may infer that changes in classification accuracy with spatial resolution 
arise from two conflicting trends. Firstly, the variance of spectral response or scene noise will 
decline with coarsening resolution which will usually help improve classification accuracy. The 
strength of improvements will be primarily controlled by the degree of spectral heterogeneity 
within a cover class. Secondly, the proportion of boundary pixels will increase with coarsening 
resolution and this will tend to lower classification accuracies. The form of the curves in Figures 
5 and 6 thus depends on the relative importance of these trends between any two resolutions. 
These trends themselves are largely a function of the spatial properties of the terrain which is 
being observed. Different terrain types with contrasted boundary densities and land cover 
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Figure 7a. Changes in classification accuracy with spatial resolution (IFOV) : 

a. with boundaries excluded. 
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Figure 7b. Changes in classification accuracy with spatial resolution (IFOV): 

b. with boundaries included. 
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heterogeneities will thus demand different spatial resolutions for optimal classification accuracies. 
Improvements in the spatial resolving power wUl make automated classification of land cover 
feasible from satellites in many parts of the Earth, where currently this is impossible because of the 
small size of their land cover units. But, we must also recognize that higher resolution may not 
necessarily improve classification accuracy in all terrain types, and it may be advisable actually to 
degrade higher resolution imagery for some tasks. The latter may be particularly appropriate if 
classification is based on simple per-point classification (i.e. using only the spectral information for 
each pixel and no information from surrounding areas) which is the most common type used at 
present. However the extra information contained within higher resolution images can be exploited 
by using measures of image texture (e.g. Haralick, 1979) or by using contextual algorithms which 
classify a pixel using its relationship to other previously classified pixels (e.g. Swain et al., 1980). 

On the other hand, computation times of such classifiers may be significantly higher than conven- 
tional per-point classifiers. 

Various estimates have been made of the minimum size of area which can be satisfactorily 
classified. Work by General Electric (1975) suggests that an area four by four pixels in size is the 
minimum. However it has been suggested that the number of boundary pixels should be no greater 
than half the total area to be classified and this implies for square areas that the total number of 
pixels should be approximately 60 (Shay et. al., 1975) corresponding to a square whose sides are 
between 7 and 8 pixels long. For Landsats 1 to 3 this means the minimum classifiable area will be 
approximately 320 by 220 meters or 500 by 350 meters depending on which criterion is used 
(taking the across scane overlap into account). For the Thematic Mapper of Landsat-D the corres- 
ponding figures will be approximately 1 20m and 230m and for the multispectral sensor of SPOT 
the values are approximately 80m and 1 50m. 

3.2 Visual interpretation 

Relatively little work has been done on the effects of changing resolution on the visual inter- 
pretation of images from future satellite system.s, though there exists an extensive literature on the 
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human assessment of image quality in general (see Schindler [ 1979] and Bylander [ 1979, Chapter 
3] for recent reviews). One example where tests were performed on degraded aircraft images to 
simulate satellite imagery showed a simple monotonic decline of classification accuracy with resolu- 
tion for four out of five cover types (Lauet and Thaman, 1971). This result is cieariy at variance 
with many investigations of computer-assisted classification. Visual interpretation is more im- 
portant however for those tasks where inferences have to be made about sub-surface conditions as 
in geological or geomorphological or hydrological survey. Recently return beam vidicon (RBV) 
imagery from Landsat 3 have become available with an estimated IFO.V of about 40m (RCA Govern- 
ment Systems Division, 1977). Visual inspection of examples of such imagery suggests that it 
is often significantly more interpretable than Landsat MSS imagery (Justice and Townshend, 1979), 
though its advantages are a function of the spatial terrain frequencies that are present (Townshend 
et al., i 979; Townshend and Justice, 1979). It is clear that improvements beyond ine 30m IFOV 
of the Thematic Mapper of Landsat-D will yield useful data for the visual interpreter. For example 
Merifield and Lamar (] 975) in mapping geological faults found significantly more ve;ue in Skylab 
photography with an IFOV' equivalent of 15 to 20m than in photography with IFOV equivaienr 
of 30-40m. On the basis that visual interpretability increases logarithmically with iincar increases 
in image resolution, Welch (1977) estimated that on average there will be an approximate 40% 
improvement in image interpretability due to the increase in IFOV of the Thematic ?4apper com- 
pared with the MSS of Landsats I, 2, and 3. 

4. Significance of other factors affecting the information content of remotely sensed data 

Many factors other than spatial resolution strongly affect the information content of images. 
There is clear evidence that the new spectral bands chosen for future Earth resource sateilites 
(Table 1) will be significantly better than those of the Landsat muitispectral scanner (Tucker, 1978; 
Podwysocki et al,, 1919). Moreover there are several factors which will have a direct impact on 
the fineness of detail extractable from remotely sensed data. 
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None of these is more significant than the spatial variability of terrain itself, since improve- 
ments in resolving power will only be significant to the extent that the terrain itself has spatial 
frequencies of a size which will be better depicted. Although this point is self-evident to many, if 
not most, users of remote sensing data, quantitative knowledge of the inherent spatial variability 
of terrain attributes is generally poor. The variability of soil properties forms a notable exception 
(e.g. Campbell, 1978; Webster, 1978; Webster and Beckett, 1970; Mitchell etal., 1979) though it 
is not straightforward to convert the results of such studies into a form suitable for estimating the 
capabilities of present and future satellite systems. Estimates of field size distributions have been 
made for several different countries, but the most comprehensive ones have relied principally on 
space images themselves to derive the measurements (e.g. Podwysocki, 1976; General Electric, 
1975) and consequently a significant bias will be introduced for smaller fields. Many local studies 
of the morphometry of landforms have been made, but wide scale generalizations are not easy to 
make from them. Information on the spatial variability of other terrain attributes in a form suitable 
for conversion to resolution requirements is usually much poorer. Clearly there is a need for im- 
proved quantitative information about the spatial variability of terrain attributes if their signifi- 
cance in affecting the usefulness of remote sensing data is to be fully appreciated. 

The amount of information that can be extracted from images is also a function of image pro- 
cessing techniques. If we digitally enhance the edges present in an image for example by convolu- 
tion of the image data with a Laplacian filter (Steiner and Salerno, 1975) and add this image to the 
original then we will usually obtain a resultant image which has a much sharper appearance and 
which visually appears to have better spatial resolution (e.g. Hord, 1977). Application of Fourier 
methods either optically or digitally in which a high-pass filter is used, would also have the effect 
of enhancing the fine detail present in the image. Yet a further alternative is to design an optimum 
restoration filter based on the imaging system point spread function of the specific sensor system 
used. This has been successfully carried out by McGillem et al. (1975) for Landsat 1 imagery. 
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The benefits acquired from such products have to be set against the computation times and costs, 
which can often be substantial. However, even simple contrast stretching can produce a much more 
interpretable product for the human interpreter with much more visible detail (e.g. Lillesand and 
Kiefer, 1 979, 566-7). 

Finally it is relevant to point out that in the future political factors may restrict the availability 
of higher resolution imagery. There is an understandable concern in some countries that high re- 
solution data of their sovereign territory should not be made widely available because of the insight 
such imagery may reveal about their resources. Proposals have been made at the United Nations to 
restrict the availability of high resolution data, subject to the country itself granting permission. 
Although it may be tempting to specify a given spatial resolution value beyond which imaging is 
not allowed, this will inevitably be inequitable since difference in inherent terrain variability itself 
will strongly affect the quality of information that is extractable. The same sensing system may in 
effect be a high resolution system for one terrain type and a low resolution system for another. 

5. Conclusions 

Just as the user of maps needs to know their scale so the user of remotely sensed images needs 
to know their spatial resolution. However depending on how these data are to be used, quite 
different measures of spatial resolution are appropriate. The geometric instantaneous field of view 
that is often quoted has comparative values, but has significant limitations. Other measures have 
been derived, which often give a better indication of the usefulness of the data. Measures based on 
the modulation transfer function have significant advantages for visual interpretation whereas those 
based on the spectral properties of small finite objects are preferable for tasks involving automated 
classification. 

Changing the spatial resolution of satellite images affects the utility of satellite images in ways 
which are far from straightforward. For example the success of automated classification using per- 
point classifiers will be a balance between the importance of scene noise within classes and of the 
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frequency of boundary pixels between classes. With improving resolution, the number of boundary 
pixels will decline, but scene noise will increase: thus classification accuracies might be expected 
to improve and then worsen if a wide enough range of resolutions are considered for the same area. 

There is a tendency for users to demand ever finer spatial resolution from satellite-derived 
data. Such wishes need to be militated by the fact that improved spatial resolution does not neces- 
sarily mean improved classification accuracy. In the future users of high resolution imagery may 
well degrade it initially before attempting classification, in order to achieve more accurate results. 
Moreover improvements in spatial resolution may result in the data flow reaching unmanageable 
proportions both for the agencies producing the data and for the user. 

If the utility of satellite-derived data is to be better understood and if future satellite systems 
are to be designed more effectively, then further research is required in three areas. Firstly, resolu- 
tion measures must be more closely linked to the quality and quantity of information which can be 
extracted from the data. Measures such as the effective resolution element represent a potentially 
important step in achieving this objective. Secondly information on the spatial properties of most 
terrain attributes needs to be significantly improved. Surprisingly Earth scientists have as yet 
largely failed to provide comprehensive quantitative data in a form which is compatible with esti- 
mating objective estimates of resolution requirements. Thirdly, results from these two research 
efforts must be integrated so that the benefits from improvements in resolution can be objectively 
assessed. Benefits from improved resolution need to be expressed in terms of the gains in accuracy 
and precision with which terrain attributes can be described: moreover ultimately the economic 
benefits from incremental improvements in resolution need to be determined. 
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